INTRINSIC ERGODICITY VIA OBSTRUCTION 
ENTROPIES 



VAUGHN CLIMENHAGA AND DANIEL J. THOMPSON 

Abstract. Bowen showed that a continuous expansive map with spec- 
ification has a unique measure of maximal entropy. We show that the 
conclusion remains true under weaker non-uniform versions of these hy- 
potheses. To this end, we introduce the notions of obstructions to ex- 
pansivity and specification, and show that if the entropy of such ob- 
structions is smaller than the topological entropy of the map, then there 
is a unique measure of maximal entropy. 



1. Introduction 

Bowen showed that expansivity and specification imply intrinsic ergod- 
icity [Bow75j - that is, existence of a unique measure of maximal entropy. 
In |CT| . we introduced a non- uniform version of specification for a shift 
space which guarantees intrinsic ergodicity. In this paper, we establish our 
techniques in a non-symbolic setting. The key idea of |CT| is that the ob- 
structions to specification should have less entropy than the whole space. 
Here we adapt this idea to a non-symbolic setting, and replace expansivity 
(which is automatically satisfied by a shift space) with a condition which 
says that obstructions to positive expansivity should have less entropy than 
the whole space (we work with positive expansivity rather than expansivity 
for convenience only, see remark after Definition 12. 4p . 

After introducing precise definitions of h^- pec (f), which denotes the en- 
tropy of obstructions to specification, and h^ cp+ (f), which denotes the en- 
tropy of obstructions to expansivity, we establish the following result: 

Theorem A. Let X be a compact metric space and f:X—>Xa continuous 
map. If h^ pec (f) < hta P (f) and h^ p+ (f) < h top {f), then f is intrinsically 
ergodic. 

The obstruction entropies h^ pec (f) and h^ xp+ (f) are obtained as limits of 
quantities h^- pec (f, e) and (/, e), which are the entropies of obstructions 
to specification and expansivity at a fixed finite scale e. Theorem A is a con- 
sequence of the following result, in which we consider obstruction entropies 
only at suitable fixed scales. 
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Theorem B. Let X be a compact metric space and f : X — )• X a continuous 
map. Suppose e > is such that h^- pec (f,e) < h t0 p(f) and h^ xp+ (/, 28e) < 
htop(f)- Then f is intrinsically ergodic. 

The hypotheses of Theorem [B] are a priori weaker and potentially easier 
to check than the scale- free hypotheses of Theorem [A] It is unavoidable that 
we investigate expansivity at a larger scale than specification. This is due 
to changes of scale that occur in the course of the proof. In our setting, this 
cannot be eliminated by standard rescaling techniques of the type used by 
Bowen [Bow75j . (See remark at the end of Section [2.31 ) This accounts for 
the (not necessarily optimal) factor of 28 in the statement of Theorem IB! 

Roughly speaking, h^ xp+ (f,e) and h^- pec (f,e) may be understood as fol- 
lows. Positive expansivity is equivalent to having f] n>Q B n (x,e) = {x} for 
all x € X and all sufficiently small e > 0. We define h^ xp+ (f 1 e) as the 
supremum of the entropies of ergodic measures giving positive weight to the 
set of points at which this condition fails. 

Specification at scale e means that there exists a constant r such that ev- 
ery finite collection of finite orbit segments (xj, . . . , f ni Xi) can be e-shadowed 
by a single orbit which takes r iterates to transition from one segment to 
the next. We look for a collection S of orbit segments (obstructions to 
specification) such that any finite collection of finite orbit segments can be 
e-shadowed with gap size r provided we are allowed to first remove elements 
of S from the ends of each segment. We define h^^f, e) to be the infimum 
of the entropy of such a collection S. 

Our goal for this note is to establish these techniques and concepts, par- 
ticularly the use of h^ xp+ . As a first novel application of Theorem [Bj we 
can establish intrinsic ergodicity for non-symbolic factors of /3-shifts, sub- 
ject to a weak expansivity condition, extending our results from [CT] . This 
includes the following result: 

Theorem C. Every positively expansive factor of a (3-shift is intrinsically 
ergodic. 

This result suggests that arbitrary /3-shifts may have value as coding 
spaces. The coding of (algebraic) dynamical systems by /3-shifts has been 
well studied in the special case that (3 is a Pisot number, and hence the 
corresponding /3-shift is sofic (see [SchOOl ISid03[ ILS04| for an extensive list 
of references). 

2. Definitions 

Let X be a compact metric space and /:X-}Ia continuous map. We 
recall some standard definitions, and introduce the definitions of hjr pec and 
h^ x + . The notions that follow depend on the dynamics /, although our 
notation suppresses this dependence in order to simplify the presentation. 
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2.1. Entropy. 

Definition 2.1. ^4 set E C X is (n, e)-separated /or some n G N and e > 
i//or all x ^ y G E we have y £ B n (x,e), where 



and d n (x,y) = m&x{d(f i x, f l y) | < i < n — 1}. Given Z C X, we write 
A(Z,n,e) for the maximum cardinality of a (n,e)- separated set E C Z. Let 

(2.1) h(Z,e) := Inn - log A(Z, n, e), h(Z) = lim h(Z, e), 

and define h analogously. If h(Z,e) = h(Z,e) then we write h(Z,e) for the 
common value and call this the topological entropy of Z at scale e. 

Remark. In fact, h and h are the lower and upper capacity topological en- 
tropies |Pes97] . If Z is invariant then h(Z,e) = h(Z,e) for all e > |Pes97} 
Theorem 11.5]. In particular, h(X,e) exists for every s, and h(X) is the 
standard definition of topological entropy for the system (X,f). For unity 
of notation, from now on we will usually write h(X) in place of ht op (f). 

The variational principle states that h(X) = sup^ h^f), where h^(f) 
is the measure-theoretic entropy of (i and the supremum is taken over all 
Borel /-invariant probability measures on X [Wal82 . A measure achieving 
the supremum is a measure of maximal entropy (MME) , and a system with 
a unique MME is called intrinsically ergodic [CU IWei701 IHof791 IHof811 



We will have occasion to consider the entropy of a sequence of sets Z n . 
Such a sequence generates a subset T> C X x N in a natural way, as T> = 
\J n Z n x {n}. Conversely, given V C X x N we obtain a sequence of sets 
V n = {x G X | (x,n) G V}. We think of subsets PcIxNas collections 
of points and times, so that given x G X, the set {n G N | (x, n) G V} can 
be thought of as those times at which the orbit segment (x, f(x), . . . , f n {x)) 
satisfies a certain property. 

Definition 2.2. Give V C X x N, write A{T>,n,e) = A(V n ,n,e). The 
(upper) topological entropy ofT> is 

(2.2) hCD,e) := Tim ~ log A(£>, n, e), h(V) = lim h(T>, e). 

n— KX) 77, e— >0 

This procedure of taking the capacity entropy of a sequence of sets appears 
in the definition of coarse multifractal spectra, and a general discussion of 
the relationship between this quantity and other dimensional quantities is 
given in |Clil3| . 

2.2. Expansivity. Positive expansivity at scale e is equivalent to the fol- 
lowing property: for every x G X, we have 



B n (x,e) = {y G X \ d n (x,y) < e} 



IBuz97| . 



(2.3) 
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Definition 2.3. Denote by M + (e) := {x G X | $+(e) / {x}} the set of 
non-expansive points at scale e. Following Buzzi and Fisher [BFllJ, we say 
that an f -invariant measure is almost positively expansive at scale e if 
n(J\f + (e)) = 0; in other words, z/<&+(e) = {x} for [i-a.e. x. 

Definition 2.4. Writing A4 e j for the set of ergodic f -invariant Borel prob- 
ability measures on X, the entropy of obstructions to positive expansivity 
at scale e is 

h^ xp+ (f,e) = sup{/i M (/) | \x G M e j is not almost positively expansive] 
= sup{/i M (/) | n G M e f and fi(J\f + (e)) > 0}. 

We also define h^ xp+ (/) = lim e ^. h^ xp+ (/, e) . The limit exists since h^ xp+ (/, e) 
is monotonic when considered as a function of e. 

Remark. Positive expansivity is the appropriate definition for non-invertible 
maps. For invertible maps, Buzzi and Fisher define almost expansivity using 
^xif) '■= f]n>o f n (^2n{f n x, e)) and requiring that \i give zero measure to 
the non-expansive set M(e) := {x G X \ $> x (e) ^ {x}}- Thus, when / is 
an expansive homeomorphism on X, every /-invariant measure is almost- 
expansive. We can define the entropy of obstructions to expansivity at scale 
e for a homeomorphism / as 

h eX9 [e) = sup{/i M (/) | fjL is /-invariant and not almost expansive} 
= sup{/i M (/) | fx is /-invariant and n{N{e)) > 0}. 

We do not consider the invertible case or h^ xp in this note. This is purely 
for convenience. Our proofs go through with minimal modification in the 
situation when / is invertible and /i ex p is less than the entropy of the space. 

The following proposition gives a useful method for bounding + ■ 
Proposition 2.1. h^ xp+ (f , e) < h(M + (e)) . 

Proof. Given h < h^ xp+ (f,e), let \x G M e j be such that /i M (/) > h and 
/i(A/" + (e)) > 0. By the Brin-Katok local entropy formula [BK83], there 
exists N G N and Y C X such that fj,(Y) > l-[i(N + (e)) and fj,(B n (x,e)) < 
e -nh for ajj n > N _ Taking Z = YnM + (e), we get fx(Z) > 0. 

Let E n C Z be any maximal (n, e)-separated set. By maximality, we have 
Z C \J xeEn B n (x,e), whence n(Z) < (#E n )e~ nh . Thus #E n > u{Z)e nh , 
and it follows that h(N + {e)) > h(Z,e) > h. Since h < h^ xp+ (f,e) was 
arbitrary, we are done. □ 

The following two propositions are crucial to our approach. 

Proposition 2.2. If fj, is almost positively expansive at scale e and A is 
a measurable partition of X such that every element of A is contained in 
B(x,e) for some x G X, then A is (one-sided) generating for \i. 
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Proposition 2.3. Ifh^ p+ (f,e) < h(X), then h(X,e) = h(X). 

Proposition 12.21 is immediate from the definitions. Before proving Propo- 
sition [2T3l we introduce a useful concept. 

Definition 2.5. Let E n C X be an (n,e) -separated set of maximum cardi- 
nality. We say that a partition A n is adapted to E n if A n is a partition such 
that each atom w of the partition satisfies 

B n (x, |j C w C B n (x,e) 

for some x E E n . 

The existence of adapted partitions for any E n follows from the fact that 
the sets B n (x,e/2) are disjoint and the sets B n {x,e) cover X. For every 
n > 1, the non-expansive set for the map f n with respect to the Bowen 
metric d n = maxo<fc< n do f k coincides with the non-expansive set for / with 
respect to the original metric d. Thus, Proposition 12.21 shows that if \x is 
almost positively expansive at scale e and A n is an adapted partition for a 
maximal (re, e)-separated set, then A n is generating for \i under the map f n . 

Proof of Proposition \2.SX For each n > 1, let E n be a maximal (re, e)-separated 
set, and A n an adapted partition for E n . Fix u G Aij with h^(f) > 
+(/, e), noting that such \x exist by the variational principle. By the 
above remarks, A n is generating for \x under the map f n . In particular, we 
have 

M/) = ~KU n ) = -KU n ,A n ) <-\og#A n = -log#E n , 
n n n n 

and sending n — > oo gives h^(f) < h(X,e). Taking a supremum over all 

such fi and applying the variational principle gives the result. □ 

We will see in $33] that Proposition 12.31 implies the existence of an MME. 

2.3. Specification. 

Definition 2.6. A collection of points and times Q C XxN has specification 
at scale e if there exists r € N such that for every {(xj,nj) | < j < k} C Q, 

k 

(2.4) f|r^o 1 K+-) jB% .( Xj , e )^0. 

j=0 

If the intersection always contains a periodic point with period Yli=o( n i + r )> 
we say that Q has (Per)-specification at scale e. 

Remark. There are two standard definitions of specification for the dynam- 
ical system (X, /), which differ from each other only in whether or not one 
requires that the shadowing orbit be periodic. Using our terminology, these 
properties are equivalent to the requirement that the entire set X x N has 
specification, or (Per)-specification, for every e > 0. 
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Definition 2.7. A triple (V, Q,S) C (IxN) 3 is a decomposition for (X, f) 
if for every x € X and n 6 N there exist p,g,s 6 N with p + g + s = n so 
that 

(x,p)€P, (f p x,g)eg, (f p+9 x,s)eS. 

That is, every orbit segment can be decomposed into a 'prefix' from V , a 
'good' core from Q, and a 'suffix' from S. Given a decomposition (V,G,S), 
we will also need to consider the following collections of points and times: 

g M ■- {( x , n) £ X x N | p{x, n) < M, s(x, n) < M}. 

Remark. Note that X x N = {J M G M , so that a decomposition (V,Q,S) 
defines a filtration of the collection of all points and times. We are interested 
in the situation when each level Q M of this filtration has specification. In 
the course of the proof we will see that every MME fi has the property that 
inf n ^{Gn 1 ) -> 1 as M -> oo. This suggests that the collections Q M can 
be thought of as regular sets analogous to the Pesin sets of non-uniform 
hyperbolicity theory. 

Definition 2.8. The entropy of obstructions to specification at scale e is 

^ec(/> £ ) : = inf U S, 3e) I V,S C X x N are such that 
there exists a decomposition (V,G,S) for (X,f) 

for which every Q M has specification at scale e}. 

We define h^p ec (f,e) = lim £ _^o h^ pec (f,e). The limit exists since /i^,ec(/; e ) 
is monotonic when considered as a function of e. 

Remark. A priori, Q could have specification at scale e, but not at a smaller 
scale 5. Thus, h^ ec (f,e) may increase as e decreases, which is why the 
hypotheses in Theorem [Blare a priori weaker than those in Theorem [Al This 
contrasts with the classical setting of Bowen, where specification at a fixed 
scale and expansivity at that same scale together imply the specification 
property at all smaller scales. This can be shown by using expansivity and 
compactness to find N = N(5,e) such that B^{x,e) C B(x,5) for every x, 
so that specification at scale e with gap size r implies specification at scale 
8 with gap size r + N(5,e). This argument does not work in our setting, 
because although such an N = N(x, 5, e) exists for each x M + {e), it cannot 
be chosen uniformly. 

3. Proofs 

Throughout this section, (X, d) is a compact metric space and / : X — ^ X 
is continuous. Large parts of the proof work without the full strength of the 
hypotheses of Theorem ITU so for each partial result we state precisely which 
conditions are needed. Our method is inspired by Bowen's original proof 
from [Bow75], incorporating the modifications from [CT| that allow us to 
use the weakened version of specification. 
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We outline the strategy of the proof. First, we obtain a series of com- 
binatorial estimates which yield bounds on the maximum cardinalities of 
various (n, 5)-separated sets (Lemmas I3.lti3.8p . These estimates allow us to 
construct a measure of maximal entropy with a certain weak Gibbs prop- 
erty (Lemma 13. 10H . which we show is ergodic (Proposition 13.131) . We use 
the weak Gibbs property, together with a crucial combinatorial estimate on 
positive measure sets (Lemma I3.12H , to rule out the existence of any other 
measure of maximal entropy. 

3.1. Lower bounds on X. We obtain a lower bound on the maximum 
cardinality of (n, <5)-separated sets for X using a completely general, and 
rather standard, argument which is essentially contained in Bowen |Bow75j . 

Lemma 3.1. A(X, Y2j=i n j, 25) < Y\j = i A(X, rij, 5) for every 5 > and rij. 

Proof. Let E be maximal (^ rij, 2<5)-separated and let Ej be maximal (rij, 6)- 
separated. Then Ej is (rij, <5)-spanning, and we define a map ir: E 
Ei x • • • x Ek by the condition that f n i+-+ n j-i ( x ) g 5 (nj(x), 6) for each 
j. This map is injective. □ 

Lemma 3.2. For every n G N and 5 > 0, we have A(X,n,5) > e nh ( x , 2S ) . 

Proof By Lemma [3TT1 we have A(X, kn, 25) < (A(X, n, 5)) k . It follows that 
logA(X, n, 8) > n(-j^logA(X,kn,25)), and we send k — > 00. □ 

3.2. Upper bounds on Q. Using a standard argument based on the spec- 
ification property of Q, we obtain upper bounds on the cardinality of an 
(n, <5)-separated set in Q n . 

Lemma 3.3. IfQcXxN has specification at scale 5 with gap size r, then 

k 

A(X, ni + ■■■ + n k + (k- l)r, 5) > JJ A(G, rij, 36) 

3=1 

for every m, . . . ,n k . 

Proof. Let Ej C Q nj be a (rij , 35)-separated set of maximum cardinality, 
and use the specification property to define a map tt : E\ X • • • E% — > X such 
that ir(xi, . . . , Xk) is contained in the intersection from (|2.4p . A computation 
similar to the proof of Lemma 13 . 1 1 shows that the image of tt is (n\ + • • • + 
nk + (k — l)r, <5)-separated. □ 

Lemma 3.4. If Q C X x N has specification at scale 5 with gap size t, then 
A(g,n,35) < e (^r)h{x,5) j or every n G N . 

Proof. Every (m, 5)-separated set is (n, <5)-separated for n > m. Thus 
Lemma [3.31 implies A(X, k(n + r), 5) > (A(Q, n, 35)) k , and the result fol- 
lows by sending k — > 00 in the resulting inequality log A(Q, n, 35) < (n + 
T )( 1 ^ ) \ogA(X,k(n + T),5)). □ 
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3.3. Lower bounds on Q and related sets. First, we prove the following 
useful summability result. 

Lemma 3.5. // V, S C X x N are such that h{V US, 5) < h(X,5), then 
J2 i>M H'P U S, i, S') e - ih( - x ^ -> as M -»• oo for every 6' > 6. 

Proof. Write h := h(X, 5) and let 7 > be such that h(V US, 5) <h - 2j. 
Then there exists a constant C > such that A(V U S, n, 5) < Ce n<yh ~^ for 
all n, and so 

e~ nh A{V U S, n, 5') < e~ nh A{V U S, n, 5) < C e~ nh e n( - h ~^ = Ce~ n \ 

Thus Y.n=o e ~ nhk ( V US,n,5') < CXr=o( e " 7 ) n < °°> and the tail of the 
series converges to 0. □ 

The following result holds for decompositions where Q has specification, 
without any conditions on Q M or on expansivity. The result should be 
interpreted as a lower bound on the growth rate of a 'fattened up' version 

of g. 

Lemma 3.6. Let {V,Q,S) be a decomposition for (X,f) such that 

(1) Q has specification at scale 5 with gluing time r, and 

(2) h(VuS,5) < h(X,5). 

Then for every 71, 72 > there exists M € N such that the following is true: 
For any V C X x N such that A(V,n,Q5) > ^ 1 e nh( - x ^ for all n, we have 
A{V n G M , n, 65) > (1 - 72)A(X>, n, 65) for all n. 

Proof. Let D n cPbea (n, 65)-separated set of maximum cardinality, and 
given i,j, k G N, let 

D(i, j,k) = {x £ D n \n = i+ j + k, p(x, n) = i, g(x, n) = j, s(x, n) = k}, 

where p,g,s are as in Definition 12.71 Observe that \J{D(i, j,k) j i,k < 
M} cPfl Q M for every M, and in particular 

Dn ■= [J{D(i,j, k)\i + j + k = n,i,k<M} 

is an (n, 6<5)-separated subset of V n Q M , so A(V n Q M , n, 65) > #D?f . 

Let Ef C Vi be maximal (i, 35)-separated, and similarly for E*? C Gj 
and E^ C Sk- As in Lemma 13.11 there is an injection tt: D(i,j,k) — > 
Ef x x E^, and we obtain 

A{V,n,65)< #D(i,j,k)+ £ A{V, i, 3S)A(G, j, 3<5)A(5, k, 3d). 

i+j+k=n i+j+k=n 
i,k<M iVk>M 

The first sum is equal to jfD^. By Condition ([T]), Lemma I3~4l gives 
A(V, n, 65) < A(V n G M , n, 65) + ^ A(V, i, 3<5)A(5, k, 35)e^ +T ^ x ^ . 

i+j+k=n 
iVk>M 
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Write o» = A(V U S, i, 35)e- ih ( X ' 5 '> and observe that by Condition ($) and 
Lemma ELS we have bu '■= J2i>M a i — as M — )• oo. Together with the 
condition on T>, this yields 

A(V,n,6S) <A(Vng M ,n,6S) + e {n+T)h{X ' S) ^ ai a k 

iVk>AI 

< A(V n G M ,n,65) + ^ 1 A(V,n,65)e Th( - X ' 5 h 2 M . 

Choosing M such that ^ 1 e rh ^ X '^b 2 M < 72, the proof is complete. □ 

Remark. If we assume the hypotheses of TheoremlB| then we have h^, ec (f, 5) < 
htop(f) = h t0 p(f,S) for every 5 € [e, 28e], where the equality uses Proposi- 
tion [2T3J This gives the existence of a decomposition (V, G,S) for (X, f) so 
that Q M has specification for each M, and h(VUS,5) < h(X,S). Thus, 
the results of Lemmas 13.51 and 13.61 apply to this decomposition. This is 
the only place in the proof of Theorem [B] where we use the assumption 

3.4. Upper bounds on A(X, n, 65). From now on, we assume that h^y ec (f, 5) < 
h(X,5). Thus, there is a decomposition (V,G,S) for (X,f) satisfying 

(1) each G M has specification at scale 5 with gluing time tm, and 

(2) h(VUS,5) < h(X,5). 

In this section, we also assume that 

(3.1) h(X,126) = h(X,5), 

which, by Proposition 12.31 is a weaker assumption than h + (/, 125) < 
h(X). 

Lemma 3.7. Under the above conditions, for every 7 > there exists M 
such that A(G M ,n, 65) > (1 - j)A(X, n, 65) for all n. 

Proof. Lemma 13.21 together with (|3.ip gives A(X,n,65) > e nh ( X ' S ', so we 
can apply Lemma 13.61 □ 

Lemma 3.8. There exists C\ G R such that e nh ^ < A(X, n, 65) < de nh ^ 
for all n. 

Proof. As above, the first inequality follows from Lemma [3. 2 1 and (|3.1|) . The 
second inequality follows by applying Lemma 13.71 with 7=5 an d then 
applying Lemma 13.41 to Q M , so that 

A(X,n,65) < 2A(G M ,n,65) < ^ e ™ h( - x ^)e nh ^ s \ □ 

3.5. An MME with a Gibbs property. We replace (|3.ip with the stronger 
assumption that for sufficiently small 5 > 0, 

(3.2) h(X,5) = h(X). 



10 



VAUGHN CLIMENHAGA AND DANIEL J. THOMPSON 



This ensures the existence of an MME, constructed using the following stan- 
dard argument. Let E n C X be a maximal (n, <5)-separated set, and take 

1 \ - 



In order to obtain invariant measures, we consider the measures 

n-l 



(3.3) ^ : =l£/, 

71 * 



n A — ' 

k=0 



and let \i be a weak* limit of the sequence {/^ n }- 
Lemma 3.9. \i is a measure of maximal entropy. 

Proof. The second part of [Wal82l Theorem 8.6] shows that h^(f) > h{X, 5). 
Thus, by (|3.2p . if 5 is sufficiently small, /i is an MME. □ 



So far we have used 5 to denote the scale at which we work, since the 
results will be used for various values of 5. From now on, we fix the scale e 
as in Theorem [Bj We construct \i as above using 5 = 6e in the selection of 
the sets E n . By Prop osition 1 2 . 3 1 and the assumption h^ xp+ (f, 28e) < h t0 p(f), 
(|3.2p is satisfied for this value of 8, so \i is indeed an MME. 

We now also assume that h^p ec (f,e) < fh op (f), and show that \i satisfies 
a certain weak Gibbs property. 

Lemma 3.10. For sufficiently large M, there exists Km > such that for 
every (x,n) £ Q M , we have 



(3.4) /i(B n (x,7e)) > K M e 



-nh(X) 



Proof. Let M be large enough to satisfy the conclusion of Lemma 13.71 with 
7 = 5- Then applying Lemma 13.21 with 5 = =£e, we have A(Q M ,n, 14e) > 

l e nh(X) for all ^ and SQ there exigts E G c qM guch that ^l^G > l e nh(X) 

and is (n, 14e)-separated. 

To estimate /j,(B n (x,7e)), we first estimate u rn (f~ k B n (x,7e)) for m 3> n 
and (most values of) < A; < m. Let r = tm G N be provided by the 
specification property. Fix t < k < m — n — r; let l\ = k — r and li = 
m — k — t — n, so that £\ + r + n + r + £2 = rn. 

Using specification, for each y € E^ and z E there exists 

<p(y, z) G (y, e) n r^ T B n (x, e) n f^-^Be, (z, e); 

furthermore, because £yf is (n, 14e)-separated, tp is injective and tp{Ef^ X-Ef) 
is (m, 12e)-separated. 

Because the set E m used to construct \x is (m, 6e)-spanning, we can define 
tp: tp{E^ x Ef 2 ) — >• such that d m (w,ip(w)) < 6e, and hence ipoip: Ef x 
E 1 ? — > S m is injective. Furthermore, ip(y,z) € f~ k B n (x,e) for all y, and 
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hence ip o (p(y, z) G / k B n (x, 7e). Therefore 

(#Ef)(#Ef) e (ti+t2)h{X) 

, ra (r^(x,7 £ ))>WA^>i__, 



where the bound on the denominator comes from Lemma 13.81 This holds 
for all t < k < m — n — r, and averaging over < k < m gives 

» m (B n (x,7e)) > f m ~T 2T ) e^W). 

Sending to — oo gives the result with if&f = (4C\)~ 1 e~ 2TMh ( x \ □ 

Later on, in the proof of ergodicity, we will need a very similar lemma that 
estimates the measure of a set of points whose trajectories are prescribed 
not on a single interval, but on two disjoint intervals. 

Lemma 3.11. For sufficiently large M, there exists K' M > such that for 
every (x\,ni), (x 2 ,n 2 ) € Q M and q > 2tm, we have 

/i (S m (xi,7e) n r ni ~ q B n2 (x 2 ,7e)) > K' M e~^ +n ^ x \ 

Proof. The proof closely resembles that of Lemma 13. 101 Let M be large 
enough to satisfy the conclusion of Lemma [3T71 with 1 = \ and let r = tm- 
As before, let E% C Qjf be (n, 14e)-separated with > \e nhi ~ x \ 

Given to ^> n\ + n 2 + q and r < k < m — n 2 — q — n\ — r, let i\ = k — r 
and t 2 = m — n 2 — q — n\ — r — k, so that 

m = £i+T + ni + T + q + T + n 2 + T + £ 2 , 

where q = q — 2r. A similar argument to the one in Lemma [3.101 shows that 



v m {f- k B ni (xi,7e) n r k - n ^B n2 {x 2 , 7e)) > 



A(X, to, 6e) 

> 



sc ie mh ( x ) ' 

Averaging over < k < m gives 

Hm(r k B ni (x u 7e) n r k ~"B n2 (x 2 , 7e)) 

m-n 1 -q-n 1 -2r\ _ (4r+ni+n2)/l{x) 



^ 1 8C^ ! ' 

and sending m — >■ oo gives the result with ET^- = (8Ci) _1 e _4rM,, '( J,< '). □ 

3.6. Adapted partitions and positive measure sets for MMEs. From 
now on we need the full strength of the hypotheses in Theorem |Bj In partic- 
ular, we assume that every MME v is almost positively expansive at scale 
28e. We recall that the definition of an adapted partition is given in §2.21 
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Lemma 3.12. For every 7 E (0, 1) there exists C 7 > such that if v is 
an MME and V C X x N satisfies v{T> n ) > 7 for every n, then T> n has 
non-empty intersection with at least C 7 e n/l ( x ) atoms of A n for any partition 
A n adapted to a maximal (n, 14e) -separated set. 

Proof. Because v is almost positively expansive, we see that for i^-a.e. x € X 
the following holds: for all y 7^ x there exists m such that d(f m y, f m x) > 
28e. Writing m = qn+r for q, r £ N, r < n, we see that d n ((f n ) q y, (f n ) q x) > 
28e, and it follows that A n is generating for f n . In particular, this implies 
that 

= /!„(/", 4 n ) = inf -H u (A n V (/ n )~U B V • • • V {f n )- q+1 A n ) 

9>1 <? 



<i^(„4 n ) = ^ -1/(10) log 1/(11;). 



Without loss of generality, we assume that V n is a union of atoms of A n , 
and write this collection of atoms as W n . Writing = A n \ W n and 
T>n = X \ V n , we recall that h u (f n ) = nh u (f) = nh(X) and obtain 

nh(X) < ~ v(w) log v(w) + ~ v(w) log v(w). 

wew n mews 

Normalising each sum yields 

\wew n v n; 



(3.5) 





+ (-«/(A0 log ^(P n ) - !/(2^) log 1/(2^)). 
Recall that for any non-negative numbers ai, . . . a& summing to 1, we have 

^-ajlogaj < log A;. 

i=l 

Applying this to the first sum in (|3.5p with the quantities cij replaced by 
^=ry, and to the second sum with replaced by ^^§j, we see that 

nh{X) < u(V n ) log #W n + */(££) log #(W c n ) + H(u(V n )), 

where we write H(t) = —tlogt — (1 — t) log(l — t). Lemma 13.81 implies that 
#(W£) < #5 n < C ie nh ( x ), and so writing C 2 = logCi +max tg[0jl] #(£), we 
have 

nh(X) < u(V n )log#W n + (1 - i/(X> n ))(IogCi + nh(X)) + H(u(V n )) 
= u(V n ) log #W„ + nfcpQ - i/(P n )(log Ci + + C 2 , 
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which gives 

u(V n ) log #W„ > u(V n )(log d + nh(X)) - C 2 , 

log #W„ > log d + nh(X) - Q 

7 

Thus it suffices to take C 1 = Cxe~° 2h ■ □ 

3.7. Ergodicity. Now we consider the MME constructed in £ ]3,5l and show 
that it is ergodic. This is a consequence of the following proposition. 

Proposition 3.13. If two measurable sets P,Q C X both have positive 
^-measure, then lim ri ^^ //(P n a~ n (Q)) > 0. 

Proof. In fact, we show that for every 7 > there exists a > such that if 
min{/x(P),)u(Q)} > 27, then 

lim n(Pna- n (Q)) > a. 

As in £ )3.6t we take a sequence of partitions A n , each of which is adapted to 
a maximal (n, 14e)-separated set S n . Recall that each x € S n is identified 
with a partition element w x G „4 n such that 

5 n (x,7e) C lOs. 

Fix (5 € (0,7). Because $ x (28e) = {x} for /i-a.e. x G X, we can assume 
WLOG that this holds for every x £ P,Q. Thus f|„ B n (x, 14e) = {x} for all 
xeP,Q. 

In what follows, to simplify notation, we sometimes use the same symbol 
(such as U) to denote both a set of partition elements (U C A n ) and the 
union of those partition elements (U C X). 

Lemma 3.14. Let fj, be a finite Borel measure on a compact metric space 
X and let P C X be measurable. Suppose A n is a sequence of partitions 
such that P| n ^4 n (x) = {x} for every x G P, and let 5 > 0. Then for all 
sufficiently large n there exists a collection U C A n such that n(U AP) < S. 

Proof. This is a standard argument. Let R C P and S C X \ P be compact 
such that fi(X \(RUS)) < 5/2. Now let 77 > be such that d(x, y) > rj for 
all x G R and y G 5. For all sufficiently large n there exists R' C R such 
that \ R') < 5/2 and diam^4"(x) < 77 for all 2 G P'. 

Let f/ = {wei n |«)nfi' / 0}. Then R' C C X \ S, and the result 
follows since U A P C (X \ (RU S)) U (R \ R'). □ 

Returning to the proof of Proposition 13. 13] we see that Lemma f3. 141 allows 
us to choose, for all sufficiently large n, collections U, V C A n such that 

H(U A P) < 5 and fi(V A Q) < 5. 

In particular fJ>(U) > /j(P) — 5 > 7, and /i(V) > /i(Q) — 5 > 7. Thus, by 
Lemma 13.121 we have 

#U > C 7 e" h(x) and #V > C 7 e n?i(x) . 
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Let M be such that Lemma 13.61 holds with 71 = C 7 and 72 = \; write 
U' = {w x G U I x G £7 } and similarly for V'. Then 

(3.6) #[/' > ~C 7 e n/l W and #V > ^C 7 e nh ( x \ 

For every u> xi G U' and w X2 G V' and all m > 2r + n, Lemma 13.111 implies 
that 

/<K n /- m ^ 2 ) > ^(B n (x a ,7e) n f- m B n (x 2 ,7e)) > K' M e~ 2nh ^ . 
It follows that 

/x(C/' n r m V) > (±C*e 2nh ( x A (K' M e- 2nh ^) = =: a. 

Now since U D U' and V D V, and since for any m, 

\H(U n a~ m V) - n(P n a- m Q)| < 6, 

we have 

lim n(P n a~ m Q) > lim n(U n cr^F) - <5 > a - 5. 

Furthermore, since (5 > was arbitrary, we conclude that 

lim fi(P n a~ m Q) >a. □ 

m— >oo 

3.8. Contradiction if there is another mme. Let fi be the ergodic MME 
constructed in the previous sections, and suppose that there exists another 
ergodic MME v _L fj,. Let P C X be such that fi(P) = and v{P) = 1, and 
let A n be a sequence of partitions adapted to maximal (n, 14e)-separated sets 
S n . By Lemma EH1 there exists U n C A n such that (// + v){U n AP)^0. 
In particular, we have v(U n ) 1 an d n(U n ) — > 0. 

Now 7 := inL/i v(U n ) > 0, and so by Lemma 13.121 we have #{/ n > 
q nh(X) £ or a rj n _ ^ g m p roo f f Proposition 13.131 fix M and let 
Un = { w x G U n I x G <7 M }; by Lemma 13.61 there exists M such that 
> \C^e nh ^ for all n. 

Finally, we use the Gibbs property in Lemma 13.101 to observe that 

KUn) > KK) > (#K)K M e~ nh W > \c^K M > 0, 

which contradicts the fact that fJ,(U n ) — > 0. This contradiction implies that 
any MME v is absolutely continuous with respect to fj,, and since \x is ergodic, 
this in turn implies that v = fx, which completes the proof of the theorem. 

4. Application to factors 

We prove that every positively expansive topological factor of any (3- 
shift has a unique measure of maximal entropy. The key point is that a 
decomposition (V, Q,S) for a system (X,f) induces a decomposition for a 
factor (Y, g) in a natural way. We demonstrate this phenomenon, which was 
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worked out in the symbolic case in [CT] . in the special case of factors of the 
/3-shift. 

For a more detailed description of the structure of the /3-shift (X, a) , we 
refer to |CT] and the references therein; here we only state the properties 
we need for the proof. The key is that there exists a distinguished sequence 
w £ S + := {0, 1, . . . , 6} N (the /3-expansion of 1) such that z £ S + is in X if 
and only if a n z ^ w for all n > 0, where ^ denotes the lexicographic order. 

The language C of the /3-shift is the collection of all finite words that 
appear in sequences z £ X. We recall the decomposition of C given in [CT]: 
C s = {w\ ■ ■ ■ w n | n > 1} is the collection of prefixes of w, and Q is the 
collection of words in C that do not end with any prefix of w. That is, 

Q = {vi . . . v m £ C | ^ w\ ■ ■ ■ w n for all 1 < n < m}. 

Every word v £ £ can be decomposed as v = v% ■ ■ ■ v m w% ■ ■ ■ w n , where 
v\ - ■ ■ v m 6 Q. We will use the following properties of words in Q, which 
are easily seen from the order-theoretic characterisation above or from the 
graph presentation of X described in [CT]. 

(1) Given any d££, the word v0 k is in Q for all sufficiently large k. 

(2) For any u £ Q and any v £ C, the word uv £ C. 

Let (Y, g) be a topological factor of (X,a), and let ir: X •->• Y" be the 
factor map. 

Lemma 4.1. (Y, g) /ias positive topological entropy or Y is a single point. 

Proof. Assume that Y is not a single point. Then there exists x,y,5o so 
that d(x, y) = 35o (where d is the metric on Y). By uniform continuity of it, 
there exists 5 so that ir(B(z,5)) C S(7rz,^o) f° r ah ^ £ Fix x £ tt^ 1 (x) 
and y £ TT~ 1 (y). 

For sufficiently large n, there are words v\,V2 £ Q of length n such that 
[v\\ C B(x,8) and [t^] C B(y,5). To see this, take prefixes of x and y with 
sufficient length that the first condition holds, then append the symbol to 
these prefixes enough times that the resulting words have the same length 
and are both in Q, using property (P). 

Now, for any i = %\ . . .i m £ {1, 2} m , we can choose y» £ [vi 1 v j 2 . . . Vi m ] 
using property ([2]), and define Xi = iryi. The points {xj} form a (nm, ^o)- 
separated set in Y. Thus A(Y, nm, 5o) > 2 m for every m. This shows that 
the topological entropy of (Y, g) is positive. □ 

Proposition 4.2. /i^ cc (/, e) = /or euery e > 0. 

Proof. We obtain a decomposition (V,Q,S) for (Y, g) from the decomposi- 
tion C = QC S , then show that Q M has specification for every e > 0. (Of 
course the transition time r depends on e and M.) let 

Q = {(x, n) | x = irx for some x £ X such that x\ . . . x n £ t/}, 

5 = {(x, n) | x = 7tx for some x £ X such that xi . . . x n £ C s }. 

Setting V = 0, this defines a decomposition for Y. 
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Lemma 4.3. Q M has the specification property for all e > and M G N. 

Proof. Fix e and take 5 so that ir(B(x,d)) C B(irx,e) for all x £ X. There 
exists m = m(5) so that for all y G X, we have [yi ■ ■ ■ y m ] G B(y, 5). Then 
for every z G [y"i • • -ym+nL we have d n (y,z) < 5, and so for y = iry and 
z = vrz we have d n (y,z) < e, where d n (y,z) := max < fc < n d(g n (y), g n {z)) is 
the Bowen metric. 

Given any collection {(x 1 ,ni), . . . , (x fc ,nfc)} C let x* G vr _1 Xj be such 
that x\ ■ ■ ■ x l n . +m G . Since Q M has specification with gap length tm, we 
can find connecting words v 1 of length tm such that the cylinder 

r~l -1 1-2 ~2 2 ~fc ~1 i 

[■^1 • • • x ni+m v x l • • • x ri2+m v • • • X X ■ ■ • x n k +m\ 

is non-empty. In particular, any point z in the image of this cylinder under n 
verifies the specification property for the collection {(x 1 ,^), . . . , (x fc ,nfc)}, 
where the gap length is m + tm ■ □ 

Lemma 4.4. h(S) = 0. 

Proof. Let w = w\W2 ■ ■ ■ be the ^-expansion of 1, and let z = ir(w). Then, 
by definition, 

S C {(x, n) | X = 7TX, xi . . . x n = wi . . . w n }. 

Take 5 so that ir(B(x,5)) C B(irx,e) for all x G X. There exists so that 
if y G 5 n+ fc, then there exists y G vr~ 1 (y) Pi B n (w, 5), and thus y G B n (z, e). 
Thus, for all n, S n+ k C B n (z, e). In particular, A(<S n+ fc, n+fc, e) < A(X, k, e), 
and h(S,e) < linin^oo i log A(X, A;, e) = 0. Since e was arbitrary, we are 
done. □ 

Proposition 14.21 follows from Lemmas 14.31 and 14.41 □ 

Using Proposition 14.21 we can apply Theorem [B] to (Y, g) as long as 
^exp+ O^' e ) ^ ^(^) f° r some e - I n particular, we can apply Theorem IB1 
when (Y,g) is positively expansive. 
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